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Amendmepts to Claims 

Claims 1-34 Cancelled 

35. (Currently amended) A method of discovering one or more patterns in a set 
of k sequences of symbols, called a k-tupJe, where k is greater than or equal to two, 
within an overall set of w sequences having sequence numbers 1 , 2, . . w , the symbols 
being members of an alph abet, each sequence of symbols having respective lengths Li, 
L2, . - Lws comprising the steps of: 

a) translating the sequences of symbols into a table of ordered (symbol, posi tion 
index) pairs, where the position index refers to the location of the symbol in a sequence; 

b) for each of the w sequences, grouping the (symbol, position in dex) pairs by 
symbol to form a respective master offset table, thus creating w master offset tables; 

c) using tbe position indices in the w master offset tables to determine_the 
difference-in-position value between each occuirence of a symbol in one of the sequences 
and each occurrence of that same symbol in the otfier sequence in. each master offset table 
, fomiing a k-tuple table associated with the k-tuple, the table comprising k columns, one 
of tlie k colunans being a primary column and the remaining (k-1) columns being sxiflix 
columns, each column corresponding to one of the k sequences; 

i) the primary coliunn comprising the (symbol, position index) 
pairs of a primary sequence, 

ii) the (k-1) suffix columns comprising (symbol, difference-in- 
position value) pairs, where the difference-in-position values are the 
position differences between all like same symbols of each remaining 
sequence of the tuple and the primary sequence of the tuple, 

iii) the rows in the k-tuple table resulting from forming all 
combinations of like same symbols from each sequence; 

d) creating a sorted k-tuple table by perfomiing a multi-key sort on the k-tuple 
table, tlie sort keys being selected respectively from the difference-in-positjon values of 
the last suffix column (k*^ column) through the difference-in-position value of tbe first 
suflSx column; an^ 

e) defining arset^ one or more patterns by collecting adjacent rows of the sorted 
k-tuplc table whose suffix columns contain identical difference-in-position values, tbe 
relati ve positions of the symbols in each pattern being determined by the primary column 
position indices, the set^ one or jnore patterns being common to the k sequences. 

36. (Original) Themethodof claim 35 further comprising: 

f) deleting all patterns not satisfying a predetermined criteria. 

37. (Original) The method of claim 35 further comprising: 

f) deleting all pattems shorter than a first predetermined span and longer than a 
second predetermined span. 

38. (Original) The method of claim 35 further comprising: 

f) deleting all pattems having fewer than a predetermined number of symbols. 
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39. (Origjoal) The method ofclaim 35, further comprising the stqj of dele^^^^^ 
rows fix)Tti the k-tuple table which do not have sufHx indices identica] to any other row of 
the k-tuple table. 

40. (Original) TTie method of claim 35 further comprising the step of deleting 
rows &om the k-tuple table according to predetermined criteria. 

4h (Previously amended) Themethodof claim 40, wherein rows sharing 
identical suffix colmnn diiference-in-position values are deleted fix>m the k-tuple table if 
there are fewer than Ns such rows, where Ns is the minimum nimiber of symbols per 
pattern. 

42. (Currently amended) A method of discovering one or more patterns In a set 
of k+1 sequences of sy mbols, called a fk+lVtuole. where k is greater than or equal to 

two, wit hin an overall set of w sequences having sequence numbers 1.2 w , the 

symbols being mem bers of an alphabet each sequence of symbols having rest)ective 

lengths Lu L? L^. by first forming a k-tunle table and then fonning a (k+l)-tuple table 

by combining the r wboroin a k-tuple table is combined with a .aa_add'itional sequence 
of symbols, the fottnation of the k-tuple table comprising the stepj^ ftf: 

a) translating the sequences of symbols into a table of ordered (symbol, position 
index) pairs, where th e position index refers to the location of ^e symbol in a sequence: 

b) -.for .each of the w sequences, jgroupmg tlie (symbol, position index) pairs bv 
symbol to form a respective master oflfeet table, thus creating w master effect tables: 

c) using the position indices in the w master offset tables to determine th^ 
difference-in-position value betwee n each occurrence of a symbol in one of the sequences 
and each o ccurrence of that same symbol in the other sequence in each master offset table 
forming a k-niple tab le associated with the k-tuple, the table comprising k columns, one 
of the k co lunans being a primary column and the remaining flc-ll columns beine sufRx 
colum ns, each column corresponding to one of the k sequences: 

i) the primary column co mprising the (symbol, position index) 
pairs of a primary sequence, 

ii) the (k-l) sufSx colu mns comtmsing (symbol, difference-in- 
t>o$ition value) p airs, where the difiference-in-position values are tl; ie 
position differences between all iitee same symbols of each remaininp 
sequence of the t uple and the primary sequence of the tuple, 

iii) the row s in the k-tuple table resultmg from forming all 
combinations of titee same symbols from each sequence: 

d) _creating a sorted k-tuple table bv performing a multi-kev son on the k-tup le 
table, the sort keys being selected re spectively from the difference-in-position values of 
the last suf^Sx column (k*^column) through the difference-in-position value of the firet 
suffix column: 

the formation of the (k+D-tuple table comprising the step s of: 
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e) e} translating the additional sequence of symbols into a table of ordered (symbol, 
position index) pairs, where the position index refers to the location of the symbol in the 
additionat sectuence of symbols; 

b) £l grouping the (symbol, position index) pairs by symbol to form a master 
offset table; 

0) g) creating the (k+l)-tuple table of k+1 columns, one of the k+l columns being 
a primary column and the remaining k columns being suffix columns, by: 

i) forming all combinations of tike same symbols between the 
primary column of the k-tuple table and the master ofl&et table, 

ii) for each such combination, duplicating the correspondii^g row of 
the k-tuple table, and appending a (symbol, di£ference-in-position value) pair 
corresponding to the dijeference between the position index of the master 
offset table and the position index of the primary column. ^ 

h) creating a sorted fk-H VtunleJable bv nerfoiming a multi-kev sort_on the fk+1 V 
tuple tabl e, the sort keys being selected respecti vely from the difiFcrence-in-position 
values of the last suffix column fOcH -1 mhimri l through the difference^n-nosition value 
of tl^ie jgrst sufSx column; and 

ildefining one or more patterns bv collecting adjacent rows of the sorted nc4-l)- 
tuple table whose suffix colmnns contain identical difference-in-position values, the 
relative positions of tl ie symbols in each pattern being detemiined bv the primary column 
position in dices, the one or more patterns being conunon to the kH-1 sequences . 

43. (Currently amended) Hie method of claim 42 further comprising the step of; 
deleting patterns from a k-tuple table common to the k-tuple table and a (k-M)- 

tuple table, where the (k+l)-tuple table contains all of the sequences of the k-tuple table 
with one additional sequence, by: 

a) deleting the suffix column corresponding to a sequence not shared between tlae 
two tuple tables, thereby defining a modified table, and 

b) deleting alj rows from the k-tuplo table whose suffix colunms alj contain 
identical sets of difiference-in-position values to a row of the modified table. 

44. (Currently amended) A method of discovering one or more patterns in a set 
of k sequences of symbols, called a k-tuple, comprising the steps of: 

a) for a first pair of sequences of the k-tuple 

i) translating each sequence of symbols into a table of ordered 
(symbol, position index) pairs, where the position index of each (symbol, 
position index) pair refers to the location of the symbol in the sequence; 

ii) for each of the paired sequences, grouping the (symboC position 
index) pairs by symbol to respectively form a first master ofifset table and a 
second master offset table; 

iii) forming a Pattern comprising an array having (LI + L2 - 
1) rows by: 

A) subtracting the position index of the first master 
offset table from the position index of the second master 
offset table for every combination of (symbol, position 
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index) pair having like same symbols, the difrerencc^in; 
position value resulting from each subtraction defining a 
row index; 

B) storing each (symbol, position index) pair from 
the first master offeet table in a row of the Pattern Map, the 
row being defined by the row index, until all (symbol, 
position index) pairs have been stored in the Pattern Map; 

iv) defining a parent pattern by collecting symbols having the 
identical diffgrencc-m-position value j&om ^ch row of the Pattern Map 
and populating an output array with the collected symbols of ooob 
(gymboli position indox) poir - ofa - row of th e Pattern Map > the symbols 
being placed at relative locatiotis in the parent pattern indicated by the 
position index of the (symbol, position index) pair; an d 

v) repeating step iv) for each row of the Pattern Map; 

b) storing the discovered patterns as arrays of (symbol, position index) pairs; 

c) for each subsequent pair of sequences of the k-tuple, replacmg the (symbol, 
position index) pairs of the first sequence of the pair of sequences by the (symbol, 
position index) pairs of the stored patterns; and 

d) repeating steps (a) through (c) for each subsequent pair of sequences until the 
k-th sequence of the k-tuple is readied. 

45. (Currently amended) The method ofclaim 35, further comprising the step of 
finding all patterns at all levels of support wi thin a set of sequences by: 

f) forming a tree of nodes, where each node corresponds to each combination of k 
sequences, and therefore represents a k-tuple> and wherein each node representing a k- 
tuple is connected to all nodes representing (k-*-l)-tuples, 

each (k+l)-tuple being formed by adding a unique sequence to the k-tuple, where 
the sequence being added is later in the ordered list of sequences than th e latest sequence 
of the k-tuple; 

g) traversing the tree, and at each node visited during traversal, defining frs e tof 
one or more patterns by collecting adjacent rows of the sorted k-tuple table whose suffix 
columns contain identical sets of difference-in-position values, the relative positions of 
the symbols in each pattern being determined by the primary column position indices, the 
6et<>f one_or.more patterns beitig common to the k sequences; 

46. (Original) The method ofclaim 45, wherein the traversal of the tree of nodes 
is accomplished via recursion. 

47. (Previously amended) The method ofclaim 45, further comprising the step 

of: 

h) removing duplicate patterns at each level of support. 

48. (Previously amended) The method ofclaim 47, wherein the removal of 
duplicate pattems at each level of support step h) is accomplished by; 
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i) for each node corresponding to a (k+l)-tuple, identifyjj>g the nodes 
containing k-tuples whose sequences are subsets of the (k+l)-tuple; thereby 
defitxing a set of causallyHjcpendent nodes; 

ii) locating said causally-dependent nodes; 

iii) removing from each said causally-dependent node the patterns in 
common with the (k+l>tuple; and 

iv) if the k-tuple table hx a causally-dependent node is thereby reduced to 
zero lengthy jcemoving the corresponding node and all of its descendents from the 
tree prior to their traversal. 

49- (Previously amended) The method ofclaina 48, wherisin locating causally- 
dependent nodes in step ii) comprises the steps of: 

(A) organizing the nodes at level k in the Tuple-tree into a linked 
hst which is ordered from left to right in accordance with the sequence 
numbers represented by each tuple; and 

(B) searching said linked list for nodes which are causally- 
dependent on a particular (k+l)-tuple, 

50. (Original) The method of claim 48, wherein the nodes located in step ii) are 
causa] ly-dependent nodes at level k determined with respect to another node at level k, 
and are thus causally-depen dent on a child of th e another node at level k. 

5 1 . (Previ ously amended) The method of claim 47, wherein the removal of 
duplicate patterns at each level of support step h) comprises the steps of: 

i) organizing the nodes at level k in the Tuple-tree into a linked list which 
is ordered J&om left to riglit in accordance wi th the sequence numbers of each 
tuple; 

ii) for each pattern in the current node at level k, storing a "hit list" array 
of the sequence numbers of the sequences containing the pattero ; 

iii) for all nodes to the right of the current node whose sequence numbers arc 
all in die hit list, searching for a duplicate instance of the pattern, and if found, 
eliminating it; and 

iv) makinig each node the current node, repeating steps (ii) and (iii), in the 
order of the node's appearance in the linked list. 

52. (Previously amended) The method ofclaim 51, wherein, in step iii), the 
nodes consistent with the liit list are found using a hash tree, the hash tree having a root 
and k levels of nodes, the k-th level of the hash tree having a plurality of leaf nodes, the 
respective level of nodes of the hash tree corresponding to the respective sequence 
numbers of a k-tuple, the leaf nodes identifying the k-tuple whose sequence numbers 
correspond to the path from the root to th e leaf node, wherein 

searching due nodes for pattern duplicates is performed by repeating steps ii) and 
iii) for each node in the order of the appearance of that node in the hash tree. 
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53. (Previously amended) The method ofclaitn 45 wberein the traversiiig step c) 
itself comprises the stqjs of: 

i) creating a Virtual Sequence Array of patterns found within the sequences, 
wherein the patterns are termed P-nodes and the tuple nodes are temed T-nodes, 

(ix) finding a P-node list corresponding to the location of each pattem in the 
primary sequence of that tree node, 

iii) searchin g th e P-node list for a duplicate instance of the pattem, 

(A) if no duplicate is found: 

(1) adding a poin ter to the pattem of the current T-node 
pattem array, 

(2) finding all locations of the pattem within the Virtual 
Sequence Array, 

(3) adding a pointer to the pattem to each corresponding P- 
node array; 

(B) if a duplicate pattern is found: 

(1) ignoring the pattem if the duplicate pattem was found at 
support equal to the current level of support, 

(2) if the duplicate pattem was found at a previous level of 
support, unlinking the duplicate pattem from its previous T-node (if 
it exists), and relinking the duplicate pattern to the current T-node, 

(3) repeating steps 1) and 2) until all of the children of a T- 
node have been created, thus insuring that patterns of that T-node 
that are at their ultimate level of support are reported^ and 

(4) deleting the T-node. 

Claims 54-<>5 Cancelled 

66. (Previously amended) A computer-readable medium containing a plurality of 
data structures useful in coDtrolJing a computer system to discover o-set"^ one.orjnore 
patterns in k sequences of symbols within an overall set of w sequences, the plurality of 
data_structures comnrisinfl: 

a number w of master offset table data stmctures each grouping, 

for each value of a difference in position between each occurrence of a 
symbol in one of th e sequen ces and each occurrence of that same sytnbol in 
each other sequence, 

the position (position index) in the first sequence of each symbol 
therein that appears in each of the other sequences at that difference-in- 
position value; 

a first-additional k-tuple table data structure coxnprising columns and rows, the 
columns comprising (symbol, position index) pairs and (symbol, difiference-in-position 
value) pairs; and 

a ge oond - odditiongrf sorted_k-tuple_table data stmcture comprising a row-sorted 
representation of ttoe (symbol, position index) pairs and (symbol, difference-in-position 
value) pairs contained in the first odditional k-tunle table data structure ^ 
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wherem adjacent rows of the sorted k- tu ple table data structu re whose suffix 
columns contain idgotical difTerence -in-position values 

define one or more patterns of symbols^ the relative TOsitions of symbo ls in each pattern 
bein2 detemimed bv the primary column position indic es in the sorted k-tuple table data 
stmcture - 

67. (Currently amended) The computer-readable medium of claim 66 wherein 
the fi ooond additional sorted k-tuple table data structure further groups, for each value of a 
difference in position, the number of symbols in the first sequence that appear in the 
second sequence at that difference-in-position value. 

68. (Currently Amended) A computer-readable medium containing instructions 
for controlling a computer system to discover one or more patterns in a set of k sequences 
of symbols, called a k-tuple, where k is greater than or equal to two, within an overall set 
of w sequences having sequence numbers 1 , 2, . . w , the symbols being members of an 
alphabet, each sequence of symbols having respective lengths Li, L2, ♦ - Lw, by executing 
a method comprising the steps of: 

a) translating the sequences of symbols into a table of ordered (symbol, position 
index) pairs, where the position index refers to the location of the symbol in a sequence; 

b) for each of the w sequences, grouping the (symbol, position index) pairs by 
symbol to form a respective master offeet table, thus creating w master offset tables; 

c) using the position indices in the w master of&et tables to determine the 
difFerence-in-position value between each occurrence of a symbol in one of the sequences 
and each_occuirence of that same symbol, in the other sequence in each master offset table 
, forming a k-tuple table associated with the k-tuple, the table comprising k columns, one 
of the k columns beiixg a primary colunm and the remaining (k-1) columns being suffix 
column s, each column corresponding to one of the k sequences; 

i) the primary column comprising the (symbol, position index) 
pairs of a primary sequence, 

ii) the (Ic-l) suffix columns comprising (symbol, difference-in- 
position value) pairs, where the difference-in-position values are the 
position differences, between all like same symbols of each remaining 
sequence of the tuple and the primary sequence of tlie tuple, 

iii) the rows in the k-tuple table resulting fiom forming all 
combinations of like same symbols fiom each sequence; 

d) creating a sorted k-tuple table by perfonning a multi-key sort on the k-tuple 
table, the sort keys being selected respectively from the difference-in-position values of 
the last suffix column (k^^ column) through the difference-in-position value of the first 
suffix column; and 

e) defining a set^ one or more patterns by collecting adjacent rows of the sorted 
k-tuple table whose sufSx columns contain identical difference-in-position values, the 
relative positions of the symbols in each pattern being determined by the primary column 
position indices, the $et-ef one .or_more patterns being common to the k sequences. 
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